Name of Faculty Adviser Signature of Faculty Advisor Date
نویسنده
چکیده
Word sense discrimination is the problem of identifying different contexts that refer to the same meaning of an ambiguous word. For example, given multiple contexts that include the word ’sharp’, we would hope to discriminate between those that refer to an intellectual sharpness versus those that refer to a cutting sharpness. Our methodology is based on the strong contextual hypothesis of Miller and Charles (1991), which states that ”two words are semantically related to the extent that their contextual representations are similar.” This thesis presents corpus–based unsupervised solutions that automatically group together contextually similar instances of a word as observed in a raw text. We do not utilize any manually created or maintained knowledge–rich resources such as dictionaries, thesauri or annotated corpora. As a result, our approach is well suited to the fluid and dynamic nature of word meanings. It is also portable to different domains and languages, and scales easily to larger samples of text. The overall objective of this thesis is to study the effect of various feature types, context representations and clustering methods on the accuracy of sense discrimination. We also apply dimensionality reduction techniques to capture conceptual similarities among the contexts and don’t just rely on the surface forms of words in the text. We present a systematic comparison of various discrimination techniques proposed by Pedersen and Bruce (1997) and Schutze (1998). We find that the first order method of Pedersen and Bruce performs well with larger amounts of text, but that the second order method of Schutze is more effective with smaller data sets. We also discovered that a divisive approach is more suitable for clustering smaller set of contexts, while the agglomerative method performs better on larger data. We conducted experiments to study the effect of using various sources of training, and found that local contexts of a word provide better discrimination features than a running text like complete newspaper articles. We compared the performance of our knowledge–lean method against that of a knowledge– intense approach, and found that although the latter was successful in conjunction with smaller datasets, it didn’t show significant improvements with larger data. This suggests that the features learned from a large sample of text certainly have the potential to outperform those learned from a knowledge-rich resource like dictionary.
منابع مشابه
THE PLURIFORMITY OF THE ALEXANDRIAN TEXT-TYPE IN THE CATHOLIC EPISTLES Name of researcher: Coster Shimbaba Munyengwe Name of faculty adviser:
متن کامل
Cochlear Implantation
1 TITLE: Cochlear Implantation SOURCE: Grand Rounds Presentation, Department of Otolaryngology The University of Texas Medical Branch (UTMB Health) DATE: October 29, 2012 RESIDENT PHYSICIAN: Joseph L. Russell , MD FACULTY PHYSICIAN ADVISOR: Dayton Young, MD FACULTY PHYSICIAN ADVISOR: Tomoko Makishima, MD, PhD DISCUSSANT: Dayton Young , MD SERIES EDITOR: Francis B. Quinn, Jr., MD ARCHIVIST: Meli...
متن کاملGravel Grant Proposal Simulator for Social Agent Behavior
My research interest is social behavior in computer agents. I will extend the functionality of the simulator for discrete games with interagent communication I have written, use it to develop socially aware agents, and then attempt to integrate those agents into an environment where they can interact with human players. Applicant Name: Steven Damer Applicant Signature: Faculty Mentor Name: Mari...
متن کاملEffect of the discipline of formal faculty advisors on medical student experience and career interest.
OBJECTIVE To examine whether the discipline (family medicine vs other specialty) of formally assigned faculty advisors affected medical student experience and career interest. DESIGN Survey. SETTING University of Calgary in Alberta. PARTICIPANTS A total of 104 medical students from the graduating class of 2011. MAIN OUTCOME MEASURES Number of times medical students met with their adviso...
متن کامل